The Effects of an Arcsin Square Root Transform on a Binomial Distributed Quantity
نویسنده
چکیده
This document provides proofs of the following: • The binomial distribution can be approximated with a Gaussian distribution at large values of N . • The arcsin square-root transform is the variance stabilising transform for the binomial distribution. • The Gaussian approximation for the binomial distribution is more accurate at smaller values of N after the variance stabilising transform has been applied. The conclusion contains some comments concerning the relationship between the variance stabilising transform and the improved accuracy of the Gaussian approximation, which holds for both the binomial and Poisson distributions. 1 Gaussian Approximation to the Binomial Distribution The binomial distribution P (n|N) = N ! n!(N − n)! nq(N−n) (1) gives the probability of obtaining n successes out of N Bernoulli trials, where p is the probability of success and q = 1− p is the probability of failure. The shape of the distribution approaches that of a Gaussian distribution at large N as a consequence of the central limit theorem. In order to demonstrate this, it is necessary to expand the distribution as a Taylor series around the maximum. First, let n′ be the position of the maximum of P (n), giving n = n′+η, and rather than expanding the distribution itself, expand the natural logarithm of the distribution. Expanding as a Taylor series f(a+ h) = f(a) + hf ′(a) + h 2! f ′′(a) + ...+ h(n−1) (n− 1)! (n−1)(a) + ... (2) gives ln[P (n′ + η)] = ln[P (n′)] + ηB1 + η 2! B2 + η 3! B3... (3) where Bk = ∣
منابع مشابه
برآورد فاصله اطمینان برای نسبتهای نزدیک به صفر و یک: یک مطالعه ثانویه مدل سازی
Background and Objectives: When computing a confidence interval for a binomial proportion p, one must choose an exact interval that has a coverage probability of at least 1-α for all values of p. In this study, we compared the confidence intervals of Clopper-Pearson, Wald, Wilson, and double ArcSin transformation in terms of maintaining a constant nominal type I error. Methods: Simulations w...
متن کاملZero-inflated negative binomial modeling, efficiency for analysis of length of maternity hospitalization
Background: Mothers’ delivery is one of the most common hospitalization factors throughout the world and it’s modeling can explain distribution and effective factors on rising and decreasing of it. The objective of the present study was a suitable modeling for mother hospitalization time and comparing it with different models. Materials & Methods: Present study is an observational and cross-s...
متن کاملInteger Powers of Arcsin
New simple nested sum representations for powers of the arcsin function are given. This generalization of Ramanujan’s work makes connections to finite binomial sums and polylogarithms. AMS Subject Classification: 33B10 (30B99, 40B05)
متن کاملReliability of environmental sampling culture results using the negative binomial intraclass correlation coefficient
The Intraclass Correlation Coefficient (ICC) is commonly used to estimate the similarity between quantitative measures obtained from different sources. Overdispersed data is traditionally transformed so that linear mixed model (LMM) based ICC can be estimated. A common transformation used is the natural logarithm. The reliability of environmental sampling of fecal slurry on freestall pens has b...
متن کاملBeta - Binomial and Ordinal Joint Model with Random Effects for Analyzing Mixed Longitudinal Responses
The analysis of discrete mixed responses is an important statistical issue in various sciences. Ordinal and overdispersed binomial variables are discrete. Overdispersed binomial data are a sum of correlated Bernoulli experiments with equal success probabilities. In this paper, a joint model with random effects is proposed for analyzing mixed overdispersed binomial and ordinal longitudinal respo...
متن کامل